Delving into Sample Loss Curve to Embrace Noisy and Imbalanced Data

نویسندگان

چکیده

Corrupted labels and class imbalance are commonly encountered in practically collected training data, which easily leads to over-fitting of deep neural networks (DNNs). Existing approaches alleviate these issues by adopting a sample re-weighting strategy, is re-weight designing weighting function. However, it only applicable for data containing either one type biases. In practice, however, biased samples with corrupted tailed classes co-exist data. How handle them simultaneously key but under-explored problem. this paper, we find that two types samples, though have similar transient loss, distinguishable trend characteristics loss curves, could provide valuable priors weight assignment. Motivated this, delve into the curves propose novel probe-and-allocate strategy: probing stage, train network on whole without intervention, record curve each as an additional attribute; allocating feed resulting attribute newly designed curve-perception network, named CurveNet, learn identify bias assign proper weights through meta-learning adaptively. The speed meta learning also blocks its application. To solve it, method skip layer optimization (SLMO) accelerate skipping bottom layers. Extensive synthetic real experiments well validate proposed method, achieves state-of-the-art performance multiple challenging benchmarks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Revised Loss Bounds for the Set Covering Machine and Sample-Compression Loss Bounds for Imbalanced Data

Marchand and Shawe-Taylor (2002) have proposed a loss bound for the set covering machine that has the property to depend on the observed fraction of positive examples and on what the classifier achieves on the positive training examples. We show that this loss bound is incorrect. We then propose a loss bound, valid for any sample-compression learning algorithm (including the set covering machin...

متن کامل

Discrete Polynomial Curve Fitting to Noisy Data

A discrete polynomial curve is defined as a set of points lying between two polynomial curves. This paper deals with the problem of fitting a discrete polynomial curve to given integer points in the presence of outliers. We formulate the problem as a discrete optimization problem in which the number of points included in the discrete polynomial curve, i.e., the number of inliers, is maximized. ...

متن کامل

Delving into Transition to the Semantic Web

The semantic technologies pose new challenge for the way in which we built and operate systems. They are tools used to represent significances, associations, theories, separated from data and code. Their goal is to create, to discover, to represent, to organize, to process, to manage, to ratiocinate, to represent, to share and use the significances and knowledge to fulfill the business, persona...

متن کامل

Feature Curve Co-Completion in Noisy Data

Feature curves on 3D shapes provide important hints about significant parts of the geometry and reveal their underlying structure. However, when we process real world data, automatically detected feature curves are affected by measurement uncertainty, missing data, and sampling resolution, leading to noisy, fragmented, and incomplete feature curve networks. These artifacts make further processi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i6.20661